Constructing a User Preference Ontology for Anti-spam Mail Systems
نویسندگان
چکیده
The judgment that whether an email is spam or non-spam may vary from person to person. Different individuals can have totally different responses to the same email based on their preferences. This paper presents an innovative approach that incorporates user preferences to construct an anti-spam mail system, which is different from the conventional content-based approaches. We build a user preference ontology to formally represent the important concepts and rules derived from a data mining process. Then we use an inference engine that utilizes the knowledge to predict the user’s action on new incoming emails. We also suggest a new rule optimization procedure inspired from logic synthesis to improve comprehensibility and exclude redundant rules. Experimental results showed that our user preference based architecture achieved good performance and the rules derived from the architecture and the optimization method have better quality in terms of comprehensibility.
منابع مشابه
Personalized Spam Filtering for Gray Mail
Gray mail, messages that could reasonably be considered either spam or good by different email users, is a commonly observed issue in production spam filtering systems. In this paper we study this class of mail using a large real-world email corpus and signaturebased campaign detection techniques. Our analysis shows that even an optimal filter will inevitably perform unsatisfactorily on gray ma...
متن کاملAn E-mail Authentication and Disposable Addressing Scheme for Filtering Spam
The number of spam mails has spread rapidly in recent years. Currently, the most common spam filtering solutions include blacklisting and content filtering, as well as the Bayesian approach, which uses a Bayesian filter to analyze mail content to generate classifiers. However, spammers can forge their addresses or include additional information that will mislead the filtering system or mark leg...
متن کاملA Machine Learning Approach to Server-side
Spam-detection systems based on traditional methods have several obvious disadvantages like low detection rate, necessity of regular knowledge bases’ updates, impersonal filtering rules. New intelligent methods for spam detection, which use statistical and machine learning algorithms, solve these problems successfully. But these methods are not widespread in spam filtering for enterprise-level ...
متن کاملEnterprise Anti-Spam Solution Based on Machine Learning Approach
Spam-detection systems based on traditional methods have several obvious disadvantages like low detection rate, necessity of regular knowledge bases’ updates, impersonal filtering rules. New intelligent methods for spam detection, which use statistical and machine learning algorithms, solve these problems successfully. But these methods are not widespread in spam filtering for enterprise-level ...
متن کاملUsing visual and semantic features for anti-spam filters
It is well known that Unsolicited Commercial Emails (UCE), commonly known as spam, are becoming a serious problem for email accounts of single users, small companies and large institutions. The presence of spam can seriously compromise normal user activities, forcing to navigate through mailboxes to find the relatively few interesting emails, so wasting time and bandwidth, occupying their stora...
متن کامل